oV 2 and Yersinia pastis) based on 3 mer words. The frequency

3-mer words demonstrated a significant difference between three

s.

3-mers pattern for MT042778 (M), AB889999 (A) and QANK01002681 (Q).

7.20 shows the correlation coefficients, where it can be seen that

lation coefficient between the 3-mer vector (ܠௌ஺ோௌି஼௢௏) of the

oV and the 3-mer vector (ܠௌ஺ோௌି஼௢௏ିଶ) of the SARS-CoV-2

s was the greatest, being 0.868. The correlation coefficient

of ܠௌ஺ோௌି஼௢௏ and the 3-mer vector (ܠ௒௘௥௦௜௡௜௔) of the Yersinia

as small, being 0.192. The correlation coefficient between

௏ିଶ and ܠ௒௘௥௦௜௡௜௔ was the least, being 0.079.

Correlation coefficients for 3-mer frequency of MT042778, AB889999 and

02681.

MT042778

AB889999

QANK01002681

MT042778

1.000

0.868

0.079

AB889999

0.868

1.000

0.192

QANK01002681

0.079

0.192

1.000

erarchical cluster model was constructed for this 3-mer data

d from three sequences. The model shows that MT042778 (M for

oV-2) and AB889999 (A for SARS-CoV) were merged first

ANK01002681 (Q for Yersinia pastis) was merged with the

f MT042778 and AB889999 with a larger distance. Figure 7.9